Corpus-based Question Answering for why-Questions
نویسندگان
چکیده
This paper proposes a corpus-based approach for answering why-questions. Conventional systems use hand-crafted patterns to extract and evaluate answer candidates. However, such hand-crafted patterns are likely to have low coverage of causal expressions, and it is also difficult to assign suitable weights to the patterns by hand. In our approach, causal expressions are automatically collected from corpora tagged with semantic relations. From the collected expressions, features are created to train an answer candidate ranker that maximizes the QA performance with regards to the corpus of why-questions and answers. NAZEQA, a Japanese why-QA system based on our approach, clearly outperforms a baseline that uses hand-crafted patterns with a Mean Reciprocal Rank (top-5) of 0.305, making it presumably the best-performing fully implemented why-QA system.
منابع مشابه
ارایه یک پیکره پرسش و پاسخ مذهبی در زبان فارسی
Question answering system is a field in natural language processing and information retrieval noticed by researchers in these decades. Due to a growing interest in this field of research, the need to have appropriate data sources is perceived. Most researches about developing question answering corpus area have been done in English so far, but in other languages as Persian, the lack of these co...
متن کاملNTT's Question Answering System for NTCIR-6 QAC-4
NTCIR-6 QAC-4 organizers announced that there would be no restriction (such as factoid) on QAC4 questions, but they plan to include many ‘definition’ questions and ‘why’ questions. Therefore, we focused on these two question types. For ‘definition’ questions, we used a simple pattern-based approach. For ‘why’ questions, hand-crafted rules were used in previous work for answer candidate extracti...
متن کاملCollecting a Why-Question Corpus for Development and Evaluation of an Automatic QA-System
Question answering research has only recently started to spread from short factoid questions to more complex ones. One significant challenge is the evaluation: manual evaluation is a difficult, time-consuming process and not applicable within efficient development of systems. Automatic evaluation requires a corpus of questions and answers, a definition of what is a correct answer, and a way to ...
متن کاملEvaluating Answer Extraction for Why-QA using RST-annotated Wikipedia texts
In this paper the research focus is on the task of answer extraction for why-questions. As opposed to techniques for factoid QA, finding answers to whyquestions involves exploiting text structure. Therefore, we approach the answer extraction problem as a discourse analysis task, using Rhetorical Structure Theory (RST) as framework. We evaluated this method using a set of why-questions that have...
متن کاملQuestion answering with discriminative learning algorithms
In this paper, we describe a discriminative learning approach for question answering. Our training corpus consists of 2 million Frequently Asked Questions (FAQs) and their corresponding answers that we mined from the World Wide Web. This corpus is used to train the lexical and semantic association model between questions and answers. We evaluate our approach on two question answering tasks: 200...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008